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Abstract —Data with features of volume, velocity, variety, and 
veracity are challenging traditional tools to extract useful analysis 
for decision-making. By integrating high-dimensional analysis 
with visualization, this paper develops a 3D power-map animation 
as an effective solution to the challenge. An architecture design, 
with detailed data processing procedure, is proposed to realize 
the integration. Two of the most important components in the 
architecture are presented: the Single-Ring Law for random 
matrices as solid mathematic foundation, and the proposed 
statistic MSR as high-dimensional data for visualization. The 
whole procedure is easy in logic, fast in speed, objective and even 
robust against bad data. Moreover, it is an unsupervised machine 
learning mechanism directly oriented to the raw data rather 
than logics or models based on simplifications and assumptions. 
A case study validates the effectiveness and performance of the 
developed 3D power-map in analysis extraction. 

Index Terms —Big data, smart grid, high-dimensional analysis, 
3D power-map, visualization, random matrix, mean spectral en¬ 
ergy radius 

1. Introduction 

B ig Data analysis and its visualization are efficient solu¬ 
tions to data explosion in smart grids (US!- The rapid 
development of power systems leads to more challenging 
datasets 0. For the datasets, we can hardly handle data 
with features of volume, velocity, variety, and veracity (i.e. 
4Vs data) 0 within a tolerable elapsed time or hardware 
resources. This challenge has encouraged the development 
of an emerging and interesting field—big data analysis and 
its visualization. This field aims to extract analysis directly 
from the multivariate and multidimensional raw data in high¬ 
dimensional perspectives. In other words, big data high¬ 
dimensional analysis and its visualization establish and illus¬ 
trate the inherent correlations among the raw data to help 
understand the system and to gain insight to the mechanism, 
rather than build and analyze the models based on simplifica¬ 
tions and assumptions m 

It is generally believed that in the coming century, big data 
high-dimensional analysis will be a very significant activity 
ISl-ITOl. It has been successfully applied as a powerful tool 
in specific fields for numerous physical phenomena, such as 
quantum flu, financial systems lIT^ . biological systems ifTSl . 
as well as wireless communication networks QIU da. It is 
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obvious and significant that big data high-dimensional analysis 
applications will appear in smart grids as in other fields. 
Visualization is an essential issue in power system operation, 
forecasting, fault detection, and even Economic Dispatch. It is 
a key tool to extract analysis from the overwhelming amount 
of data to support decision-making (TEh . Much work has been 
previously done in the area of developing visualization tech¬ 
niques to aid in interpreting power system data. In 
two kinds of visualization methods are used to visualize the 
state variables in the system, which are voltage magnitudes and 
phase angles for all the buses. Color contouring is used for 
voltage magnitude while the varying height of the underneath 
terrain represents the phase angles. The 4Vs data in smart 
grids, however, are hard to be visualized with traditional 
tools. One of the major challenges is to find an effective 
way to convert the high-dimensional data to low-dimensional 
geometry as a man-machine interface. 

This paper integrates high-dimensional analysis with visu¬ 
alization to develop a 3D power-map animation. Firstly, the 
single-ring law for random matrix is introduced as mathe¬ 
matical foundations. Then an architecture is proposed as a 
universal solution to realize the integration. In the architecture, 
Mean Spectral energy Radius (MSR) is defined as a new 
high-dimensional statistic to visualize the data correlations. 
In addition, MSR clarifies the information which should be 
interchanged among each distributed partition for analysis 
extraction. A case study is presented to show the effectiveness 
and performance of the developed 3D power-map with the 
proposed statistic MSR. 

II. Mathematical Theory and Data Procedure 
A. The Single-Ring Law 

Consider the matrices product Z = where 

X^ G singular value equivalent ||2T1 of the 

rectangular NxT non-Hermitian random matrix X, whose 
entries are independent identically distributed (i.i.d.) vari¬ 
ables with mean /i(x^ J = 0 and variance cr^(x^ J = 1 for 
/c = (1, 2, • • • , A). The matrices product Z can be converted to 
Z by a transform which make the variance to cr^(z. ^) = 1/A 
for /c = (l,2, • • • , A). Thus, the empirical eigenvalues distri¬ 
bution of Z converges almost surely to the same limit given 
by 

0 , otherwise 

as A, T ^ oo with the ratio A, T = c G (0,1]. On the com¬ 
plex plane of the eigenvalues, the inner circle radius rmin _A(z) 
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is (1 — and outer circle radius is unity. In addition, we 
propose the mean spectral energy radius as an statistic MSR 
to characterize the distribution. Figure shows the cases of 
L = 1 and L = 8, respectively. 



Fig. 1: Eigenvalues A(Z) for a product of L non-Hermitian random 
matrices for N = 400, T = 1000. Figure (a) is for L = 1, and (b) 
is for L = 8. The red and green line show the inner radius and the 
mean spectral energy radius, respectively. 


B. From Power Systems to Single-Ring Law 

For a power system, at a certain time the raw data x 
can be arranged to a vector . As time goes by, vectors are 
acquired one by one and a sheet as data set is naturally formed 
to map the systems. Any arbitrary section in this set can be 
selected as raw data source Fix for further analyses. Thus, the 
Fix consists of sample vectors on a series of times denoted as 
x^^, x^^, • • • , x^ , • • •; at any time the vector x^ comprises 
sample data denoted as The 

length of n is decided by the varieties of data at a single 
sampling time, and the length of t is subject to the volume of 
historical data set and is generally big enough. 

For the raw data source Fix, we can focus on any data 
area as a split-window to form a raw matrix X. Then, we 
convert it to a standard rectangular non-Hermitian matrix X. 
It is performed by rows with following algorithms: 

=(^zi-xjx(cr(xj/cr(xj)+x^ , 1 ^ i ^ N; 1 ^ j ^ T (2) 

where x, = (x x- 2 , • • • ,x.^) and x,=0, a‘^{5c,) = l. 

The matrix X^x G is introduced as the singular value 

equivalent of the matrix X G by 

X„ = U^xx" (3) 

where U e and X„"X„ = XX^. 

Then, the matrices product Z = is acquired, 

based on which, Z is calculated by the rows with following 
formula: 

Zj = zy{VNa{z.)) (4) 

where - , z. = {z■ ■ ,z^.Y 

The above procedure of variable transformation is depicted 
as follows. The matrix Z is calculated for single-ring law and 
the mean value of all the eigenvalues’ radii is proposed 
as a new high-dimensional statistic. 


Steps of Variables Transformation 

1) raw matrix X G 

2) standard rectangular matrix X G ii{5cj^ . ) = 0, a‘^{Scj^ . ) = l 

3) singular value equivalent X^x = U^^XX^ G 

4) matrices product Z = YliLi^u^i G 

5) Z G d^X^, MZ:,/e)=0, a2(z’,) = l/iV 

^MSR — ^A(Z) 


C. Universal Architecture and its Advantages 

Based on the Single-ring Law and above transform vari¬ 
ables, we design following steps as the general architecture to 
conduct analysis and visualization in this paper. 


Steps of High-dimensional Analysis and its Visualizations 

1) Form Z for any time ti as described in Section II 

2) Calculate eigenvalues A(Z) 

3) Plot the single-ring 

4) Conduct high-dimensional analysis 

4a) Observe and Compare the results with standard ring 
4b) Calculate the statistic /^msr 

5) Conduct visualization 

6) Conduct interpretations 


With a pure statistical procedure, steps l)-4) of the pro¬ 
posed architecture 1^ enable us to conduct high-dimensional 
analysis of the interrelation and interaction among the raw data 
seen as correlations. Based on the correlation, we try to extract 
analysis directly from the raw data without simplifications and 
assumptions for engineering model or causal logic. As a result, 
the procedure is easier in logic, faster in speed, and objective 
without introducing or accumulating errors. Moreover, it is 
an unsupervised machine learning mechanism 1^ directly 
oriented to a matrix consisting of 4Vs raw data mentioned 
in Section I. In addition, due to the high-dimensional feature, 
the analysis is more robust against the incomplete, inaccurate, 
and unavailable data. These advantages will be detailed in the 
case study. 


III. Case Study 

Figure shows the standard IEEE 118-bus system with 
six partitions 1241 . Detailed information about the test bed is 
referred to the easel 18.m in Matpower package and Matpower 
4.1 Users Manual 1^ . Table I is for this case. 

There are generally two scenarios in this case: 1) only white 
noses, such as small random fiuctuations of loads or sample 
errors; 2) Signals plus noises, such as sudden changes or faults 
at certain bus or the grid. Table |T| shows the series of assumed 
events. The distribution of eigenvalues A(Z) in the single-ring 
at ts = 300 s, ts = 301 s, and the MSR on the time series 
are illustrated by Eigure and respectively. 

TABLE I: Series of Events 


PAR\Axime 

[001 : 300] 

[301 : 700] [701 : 1000] 

Fbus-22 (MW) 

0 

200 t - 500 


*Fbus -22 is the power demand of bus-22 
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Fig. 2: Partitioning network for IEEE 118-bus system. There are six 
partitions, i.e. Al, A2, A3, A4, A5, and A6. 



(a) Sampling time ts =300 s (b) Sampling time ts = 301 s 


Tlength=240 N=118||Ver=24 



Sampling Time(s) 

(c) MSR on Time Series 

Fig. 3: Ring Law and MSR at Different Sampling Times 


At sampling time ts = 300 s, the data include a time area 
ATime = 61:300 s. The noise plays a dominant part, and 
the distribution of eigenvalues A(Z) is more closely to the 
standard ring of L = 1. At sampling time ts = 301 s,[Tb| shows 
that the eigenvalues collapse to the center point of the circle. 

From /^msr"^ curve in Figure it is observed that the 
starts to decrease (0.8019,0.6789, • • • , 0.5240) just at f = 301 
s when the event of sudden change occurs as signal. This 
decreasing lasts for half of the time length (i.e. T/2 = 120 s). 
The statistic is a high-dimensional statistic for plotting 
^MSR"^ curve which is sensitive to the event. The /^msr"^ curve 
of each partition can be plotted in the same way. Both the 
voltage V and the high-dimensional statistic are able to 


be achieved from the raw data V by a statistical procedure 
without any engineering model. The former is oriented to a 
single point, whose value is fully decided by a raw data at 
a certain single sampling time; and the latter is related to a 
high-dimensional matrix, whose value is decided by all matrix 
entries consisting raw data of N varieties during T sampling 
times. 

Based on the statistical analysis above, we conduct our 
visualization. It will come to a conclusion that the combination 
of high-dimensional analysis and visualization has a better 
performance in watching the status and trends of the power 
system. 

With an interpolation method 1261 . a 3D power-map con¬ 
sisting of data is plotted. Figure]^ and [^depict some key frame 
in 3D power-map animation of the high-dimensional statistic 
Ky^^^ and of the raw data V, respectively. Comparing Figure]^ 
and it is concluded that the high-dimensional statistic K,y^^^ 
is sensitive to the events and performs better in watching the 
status and trend: 

a) At time t = 301 s, area around A2 in the power-map 
changes. Therefore, we conjecture some events occur at A2. 
The performance during times f = 302:420 s validates this 
conjecture. Moreover, we can conjecture that the events are 
more influential in the areas of A2, A3, A5 and have no effect 
on A6; and the conjecture coincides with the reality that there 
is a sudden change of Pbus -22 at t = 301 s. 

b) With sustainable growth of power demand at some bus 
(Pbus-u). the whole system becomes more and more vulnera¬ 
ble. The vulnerability can be estimated by the visualization of 

^MSR • 

c) Moreover, if the most important data (i.e. raw data V 
for A2) is lost somehow, hardly any valuable information can 
be got by V as Figure whereas the status and trend of the 
whole system can still be estimated by as Figure 

d) At last, we conjecture that the low-dimensional statistic, 
such as raw data at a single time, 1-dimensional statistic mean 
(i and 2-dimensional statistic variance a, cannot reflect the 
status and trend as well as the high-dimensional statistic Ky ^^^. 

IV. Conclusion 

This paper develops a 3D power-map animation by inte¬ 
grating high-dimensional analysis and visualization. For the 
former, we introduced the single-ring law and statistic MSR; 
and for the latter, we utilized distributed MSR data and 
interpolation method. Case study validated the effectiveness 
and performance of the 3D map in watching the status and 
trend of the whole system. Especially, its robustness against 
the bad data is a highlight—the 3D map of high-dimensional 
data is able to conduct estimation for power systems even with 
loss of the data in the most related partition. 

However, there are still some questions left. For example, to 
flgure out the relationships between the mean spectral energy 
radius MSR and the physical parameters is a long time goal. 
Apparently, during this initial stage, our aim is to raise many 
open questions than to actually answer ones. For the following 
stage, we will realize the 3D power-map using real data in the 
power grids. One wonders if this new direction will be far- 
reaching in years to come toward the age of Big Data. 
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(a)ts = 300 s (b)ts = 301s (c) ts = 302 s (d) ts = 420 s (e)ts = 820 s 

Fig. 4: Visualization of the High-dimensional Index with Full Data Sets 


(f) ts = S26 s 



(a) ts = 300 s 


(b)ts = 301s (c) ts = 302 s (d)ts = 420 s (e)ts = 820 s 

Fig. 5: Visualization of the Voltage V with Full Data Sets 


(f) U = 826 s 



(a) ts = 300 s 



(b)ts = 301s (c) ts = 302 s (d)ts = 420 s (e)ts = 820 s 

Fig. 6: Visualization of the High-dimensional Index without Data Sets of A2 


(f) ts = 826 s 




(a)ts = 300 s (b)ts = 301s (c) ts = 302 s (d) ts = 420 s (e)ts = 820 s (f) ^§ = 826 s 

Fig. 7: Visualization of the Voltage V without Data Sets of A2 
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